Categories

Versions

You are viewing the RapidMiner Studio documentation for version 10.2 - Check here for latest version

Unescape HTML Document (Web Mining)

Synopsis

Decodes HTML escape sequences contained in a document.

Description

This operator decodes HTML escape sequences. Any removal of HTML tags should have taken place previously, since unescaping might interfere HTML structure.

Input

  • document

    The document port.

Output

  • document

    The document port.